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ABSTRACT 

A year long study involving 38 students in grades 1 
to 6 was conducted to assess the degree of implementation of a . 
frequent, curriculum-based measurement and evaluation system in 
classrooms in 'which the teachers had received training in the system, 
and to examine the effectiveness of the measurement and evaluation 
-system in terms of enhancing the structure of the instructional *>- 
lessons and /student s ' reading achievement. The results indicated that 
although teachers weref skillful in the measurement part of the 
system, they were unsuccessful in^aplsflrtTig the evaluation components- 
students/ instructional program^ seld^mwere changed. In terms of the 
structure of the lessons, onry one of th^v^ structure variables 
(controlled practice) yielded signif icantly^hrigher ratings for 
experimental than for control subjects. The remaining 11 variables 
favored experimental subjects, but were not statistically 
significant. No statist icaUy significant differences in achievement 
were found between the two groups. All students improved over time. 
The results suggested that the implementation of a frequent 
curriculum-based measurement system is > feasible and successful in 
improving the structure of instruction. Achievement effects may be 
manifest if the evaluation components are applied. (Author) 
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Abstract 

A year long .study involving 38 students was conducted to (a) 
assess the degree of implementat iqn of a frequent, curriculum-based 
measurement and evaluation system in classrooms in .which the teachers 
• had received training in the system, and (b) examine the effectiveness 
of the: measurement and evaluation system in terms of enhancing the 
structure of the instructional lessons and -students' readinq 
achievement. .^The results indicated that^ although teachers were 
skillful in the, measurement part of the system, they were .unsuccessful 
in applying the evaluation components; students' instructional 
programs seldom were chanqed. In terms of the 'structure of the 
lessons, only one of the 12 structure variables' (control led practice) 
'yielded significantly higher ratings for experimental than for control 
subjects. " The remaining 11 variables favored ^experimental subjects, 
but were not statistically significant. No statistically significant 
differences in achievement were found between the two groups. All 
students improved over time. The results suggested that the 
implementation of a frequent curriculum-based measurement system is 
feasible and successful in improving* the structure l of instruction. 
■ Achievement effects may be manifest if the' evaluation components are 
applied.' \ 
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The Effects of Training Teachers in the Use of Formative Evaluation 
in Reading: An Experimental-Control Comparison 

In recent years, with • the advent of Public Law 94-142 and 
i ncreased public pressure for account ab i 1 i ty in educat i on , qreater 
demands have been placed on educators, especially special educators, 
to be accountable for the quality pf instructional decision^ and the 
ways in which they are made. Recent evidence (White & Haring, 1980) 
suggests that formative evaluation systems may .provide viable 
alternatives to the traditional pre and post testing approach to 
evaluation of academic "programs. Such systems provide continuous 
-feedback to"both r the teacher and student, allowing educators to more 
closely monitor academic progress. 

During the past five years, the Institute for Research on 
Learning Disabilities at the University " of Minnesota, under federal 
contract, has conducted a number of studies that focused on, developing 
and monitoring progress on I EP goals, as i§ intended in PL .94-14?. 

The goal of this research has been to determine empirically the 

" * 

effects of using formative evaluation techniques on student 
achievement in. reading,, spell ing, and written expression. 

Earlier research in this area determined what measures of student 
performance would be Ideal .for use in a formative evaluation system. 
The search for these -measures hegan by generating a list of desired 
characteristics, such as ease of administration, time efficiency, and 
sensitivity to growth^ over time (Jenkjns, Deno,* & Mirkin, 1979). The 
measures that were not reliable or valid,, or those that were deemed 
less suitable with respect to any of the other desired 
characteristics, were eliminated from consideration. * 



Five reading behavi.ors were generated from a. review of the 
literature and placed in the original pool for consideration. A 
series of cr i terion' val idjity studies (Deno, MiM<in, Chiang,, & Lowry, 
1980), showed that reading aloud from a basal reader, reading" aloud 

from lists of isolated words,' and guessing the words deleted from a 

• * 

reading passage (i.e., cloze comprehension) all related closely to 

performance on standardised tests .and discriminated between program 

i 

and grade placement. Such formative Measures 'of reading have also 
shown high test-retest (r = .90) and alternate forms (rs = .89 - .92) 
reliability (Shfnn, 1981). • ' r 

Related studies focused on determining the optimal duration of 
reading measurement and the type of data to record. Results from 
testing over one, two, and three-minute durations indicated that 
reading proficiency can be indexed val idly within one minute and that 

correct performance is a more valid measure of readinq proficiency 

4 

than error performance (Deno 'et al., 1980). 

Previous studies also, assessed the sensitivity jpf two reading 
measures," reading isolated word lists and reading aloud from a basal ^ 
reader. Both reading measures were found to be sensitive to changes 
within each grade level from fall to spring and across grade levels 
(Marston et* al., 1981). However, reading aloud from a basal reader 
was chosen as the optimal generic measure in reading because it 
produced a broader range of -scores than isolated words, related 
somewhat more closely to comprehension, and required . 1 Utle teacher 
preparation. 1 : 

Given that one-minute timed samples of reading from the 



curriculum have bedn shown /to be reliable and valid measures of 
reading growth, there remained the naed to test the practicality of 
such measures and the effects that teacher use of such measures might 
have on student achievement over time. Specific questions related to 
these issues were posed in the current study. 

First, can teachers luarn to use the measurement system and will 
they find it practical and time-efficient? Once the measurement 
system is implemented, will teachers use the information it provides 
to more closely monitor and change the educational program of the 
student? One of the major advantages of such a system is that it 
allows for continuous evaluation of the instructional program. Thus, 
it is critical that the information provided by the system be 'used. 

Only if these questions are answered affirmatively is it possible 
to examine the questions concerning the effectiveness of the system. 
Two questions' were investigated concerning the^ efficacy of the 
measures.. First, will the use of such measures have ,an effect on the 
structure of the learning environment provided to the student? 
Because a formative evaluation system provides continuous information 
about the need for program changes, one might expect the use of such a 
system to result in a more highly structured learning environment. 
Second, given that teachers can learn to use such, a formative 
evaluation system for both measurement and evaluation, will the use of 
such procedures have a direct effect on student achievement? One 
would ex'pect that frequent modifications in 'the instructional plan 
made possible by continuous feedback would lead to an educational 
program more sensitive to individual needs and thus more conducive to 
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growth in reading. 

Method 

Subjects 

A total of 38 elementary students in grades 1-6 participated in 
the 'study. ' (See Table i for complete breakdown of subjects by grade 
and sex.) Thirty-two of the students were' male' (84.2%) and six were 
female (15. 8%) . Students were assigned to either the experimental 
(treatment) or control (no treatment) conditions for comparison 
purposes. Data were obtained on % 19 students in each condition. ' 

, *-» 

Insert Table 1 about he ( re 



Procedures 

* •» 

Designated trainers from a large midwest suburban school district 

participated fn a ful 1-day .workshop before the beginning, of the school 

year. Principally, training focused on the use'of the measurement and 

evaluation procedures as prescribed in the IRLD manual entitled 

Procedures to Develop and Monitor Progress on IEP Goals (Mirk in, Deno, 

Fuchs, Wesson, Tindal , Marston, & Kuehnle, 1981). Subsequent to this 

workshop, the trainers trained the teacher participants in the use of 

the measurement and evaluation procedures. 

Daily measurement cons is ted of one-minute timed samples of 

reading' from' the 'Student's 1 curriculum. Both words ..correct^ and 

incorrect were scored and graphed on equal interval charts. Based on 

the results of previous research '(Fuchs % & Deno, 1981), the placement 

level for testing, which also became the baseline, was set at a 



criteria of 20-29 words per minute for gr- ar,e s 1 and 2, and* 30-39 words 
per minute for grades 3 through 6. 

Teachers were instructed to write^EP long-range goals using both 
the entry level criteria and a desired year-end ■ mastery criteria, 
usually 70 words correct per minute with no more than 7 errors. 

Short-term objectives were based on the long-range goals (LRG). 
In order to compute the short-term objective, teachers first 
subtracted the baseline level of performance from the criterion level 
listed in the LRG. Dividing this difference by the number of weeks 
necessary until the annual review, they arrived at the number of words 
per week gain necessary,, to meet the long-range goal criteria. 

In order to monitor student growth, the baseline reading level 
and the long-range goal were connected by an aimline that showed the 
students 1 desired progress. Every seven data points, the teachers 
were to monitor student growth by means of the tspLvt-middle or 
quarter-intersect method (White & Liberty; 1976). If the student was 
progressing at a rate equivalent to or greater than : that indicated by 
the aimline, the instructional program was continued; if the projected 
rate of growth was less than that indicated by the aimline, teachers 
were directed to make a substantial change in the student's program. 
M easures 

Four measures were used in collecting data: one each for 
implementation and structure, and two for achievement. The structure 
of the learning environment was assessed by means of the Structure of 
Instruction Rating Scale for both experimental and control subjects 
(Deno, King, Skiba, Sevcik, & Wesson, 1983). Degree of implementation 

. .10 



of the continuous evaluation measures— the treatment. for/, the- 
experimental subjects -was assessed using , the Accuracy -.of 
Implementation Rating Scale. Achievement measures "for both 
experimental and control groups consisted of , timed samples from three 
third grade passages, and subtests of the Stanford Diagnostic Reading., 
Test (SDRT). The three timed samples were col lected "three times 
durinq the year. The Stanford Diagnostic Reading lest was 
administered only once, in May, to both experimental and control 
subjects. Descriptions of the measures follow. 

Structure of instrCiction rating scale . . The Structure of 
Instruction Rating Scale (SIRS) was designed to measure the degree of 
structure of the instructional lesson that -a student received, in "this 
case in reading. The variables chosen for inclusion on the SIRS were- 
gathered from current literature on instruction and student academic 
achievement (cf. Stevens & Rcsenshine, 1981). * 

The SIRS consists of 12 five-point rating .scales in which a rating 
of 1 is' low for '-the variablevand 5 is. high. The rel iabi 1 i ty of the 
SIRS was assessed by means of Coefficient Alpha, a measure of internal 
consistency. For a sample of 70 students observed in November, the 
average inter-item correlation was .37, resulting in an alpha of .86. 
Thus, the SIRS seems to have a high degree of reliability as indexed 
by measures of homogeneity. ' + 

Factor analysis of the 12 variables'^Dn the SIRS revealed that 9 of 
the 12 represented one factor. Three var iables — Independent Practice, 
Positive Consequences, and Silent Practice on Outcome Behavior—were 
not measuring the same, factor."' thus, the n.ine, variables were utilized 
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in the data analyses as one factor and the other three variables were 
analyzed separately. 

Accuracy of implementation ratine) scale , ThQ Accuracy of 
Implementation Rating Scale (AIRS) is an instrument that was developed 
in conjunction with the manual Procedures to Develop and Monitor 
Progress on IEP Goals (Mirkin et>al., 1981). The AIRS is designed to 
provide a format by which to rnQnitor* the implementation of the 
procedures described in the^manual.. The AIRS consists of 12 items 
that are rated on a 1 to 5 scale, 1 being the lowest implementation 
score and 5 being "complete and. accurate implementat ion. 

Parts of the scale require direct observation whereas other items 
on the checklist can be* monitored by inspection of student reading 
graphs and by' reading IEP forms. Items 1 and 2 * of the AIRS', which 
require direct, observation, deal With the accuracy of administration 
of the measurement system and selection of. the stimulus materials. 

-For jtems '3-12 of the AIRS, research* assistants inspected various 
written documents and made the ratings. Specifically, the IRLD rater 

, examined the following documents for each student: (a) the IEP, which 
should specify the . long-range goal * and short-term objective in 
reading; (b) the reading graph; (c) the instructional plan for 
^reading; and (d) the record of changes made 'in the instructional plan 

' in rea'ding. Factors included in Jtems 3-12 pertain to the 
establishment- of the appropriate measurement level', an adequate 
baseline, an accurate long-range goal and short-term' objective, a 

. detailed graph, a. complete instructional, program,* and a correct 
aimline. These items also- included the timing of instructional 




changes and the types of changes made. Frequent checks among the four 
research assistants ^rating the accuracy of implementation assured high 
inter-rater agreement. Reliability of the AIRS was' assessed by means 
of the C'ronbach's Alpha internal consistency measure. The average, 
inter-item correlation was .12, resulting in an alpha of .62. 

/ , Results / - 

Implementati on (AIRS) . * ' 

The mean raw score ratings for each variable on the AIRS for each 
round of data collection for the experimental students are reported in 
Table 2. -As mentioned previously, variables were rated on a 1 (low) 
to 5 (high) rating scale.- Ratings were assessed by IRLD staff for all 
of. the AIRS variables except^ variables 1 and 2, which were scored by 
district observeFs/trai ners . 



..Insert Table 2 about here 



The data strongly indicate that during all three rounds of data 
collection, teachers consistently were able to employ the initial 
measurement procedures, e.g., administering the measurement task, 
selecting the stimulus materials, obtaining a baseline measure of 
performance, labeling the graph appropriately, determining .short term 
and long range objectives, as well as determining the aimline based on 
a formula outlined \r\ the manual . Procedures for Developing and 
Monitoring Progress on IEP Goals (Mirkin et al., 1981). Areas where 
teacher implementation scores could be higher involved evaluation and 
utilization of the data on an on-going basis. ^Tor example, Timing of 
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In j triirfTTWrrrK Changr*. 9 t ?nt 1 ?1 Changes, and Clear Changes were 
rated considerably lower than the other AIRS" variables previously 
discussed. For this sample, only 10 changes in the instructional plan 
were ..recorded over a five-month ■ per iod.. In ' general ,. teachers seldom, 
changed the instructional plan apce* iV vjgs established. '.' - 
Structure of Instruction (SIRS) , s 

The mean ratings for. each • variable, and Jt/ values for - the 
experimental and control group comparisons are reported in Tables 3, 
.4"; and '5. The data indicated that nine . variables' consistently stayed' 
together across the three time conditions; these emerged as a separate 
factor. The moderate to high ratings Vn these variables at all three 
points in time suggest that these aspects of classroom structure are 
fairly stable ,and present in the classrooms. . Interestingly*, 
statistically significant, differences between experimental' and control 
groups were recorded for the variable Controlled Practice across all; 
three times. At Time One, the control students received significantly 
higher ratings on this variable than the' experimental students. 
However, for both Time Two and Jime Three, the. higher mean ratings for 
the experimental students were statistically ■ significant. For both 
groups of* students, the .variables Independent Practice, Positive 
Consequences, and Silent Practice on Outcome Behavior were rated 
considerably lower than the other variables constituting classroom 
structure. 



Insert Tables /3-5 about here 
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Controlled Practice was J:he only variable for which differences 
in sample, means between the control and experimental student^ .was 
statistically significant. At Time* Three, the samp1e*mean*:rat/fnqs for 
p of the 12 SIRS variables were^Wgher for- the experimental students, 
but none of the differences'' was statistically reliable. , . 
Achievement v r ' 

Data* on t the nufnber of correct words read per minute on each of 

three reading passages are reported in Table, 6. Data Tncluded-Jn^the 

»»■... . ✓ " *» 

table were standardized to z_ scores using data from students in three 

additional research" 'sites. Using a large negative' sample to 

-standardize scores* increases^the validity of the data an d : adjusts for 

the relatively low frequency of cases* reported* at grades 1 and 2 and. 

6. .For ease of ^presentlt ioV>, these \ scores have been tr^hsformed 

into .t scores using a standard « deviation of 10 and a mean of- 100.. 

Results from an analysis of variance (see Table 7) indicated that all. 

studertts, on the'average, showed growth over time. However, the gains 

for the experimental students were not significantly different from 

those for the control students. 



Insert Tables 6 and 7 about here 




\ ■ Raw score data for the various subtests on. the Stanford 
Diagnostic . Reading Test (SDRT) for both experimental and control 
udents are reported in Table 8. Analyses by t-test comparisons 
revealed no statistically significant differences between, the two 
groups. However, the sample means were slightly higher for the 
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control students on all six SDRT subtests. 



Insert Table 8' about here 



' . , Discussion » 

The present investigation focuses on a number of important 
questions relating- to the practicality and effects of direct arfd 
frequent monitoring of progress on IEP goals. Principally, can 
teachers \earn to .use such a measurement system? Additionally, will 
teachers use the information provided by such a' system to make 
frequent changes in the educational plan and monitor the effectiveness 
of those changes? Moreover, will such a system have an 'effect'on the 
stru^. e of reading instruction the student receives and will this be 
related to reading Achievement? 

Data from the present investigation revealed that £tfea<?he.rs can 
learn to effectively administer timed reading sampltesJand accurately 
chart the data' to provide a continuous record of student growth in 
reading. Ratings of trainers on the Accuracy of Implementation Rating 
Scale support this finding. However, data also indicated that the 
teachers only partially used the evaluation component ^ of the data- 
based system. That is, the teachers' use of procedures to evaluate 

student data in order to make on-going changes in the instructional 

A ' 
plan was low; teachers could have made' use of- these procedures : to. a 

considerably greater extent. 

Also, data generated % \o assess the structure of thf^instructional 

lesson revealed tKat^experimental students engaged in significantly 

* * 

■ t 
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more control led practice of. their lessons" than the control' students. 
Moreover, by Time Three, structure variables more often .;sre rated 
higher for the . N experimental students.- This finding suggests that 
teachers who utilized the data based system provided greater -structure 
for the reading lessons. However, some structure variables, were 
consistently rated lower for' both the control- and experimental groups.- 
For example. Positive Consequences rarely was an aspect of the 
classroom setting. Given that many of the- students in this sample had 
difficulties in reading, it is surprising that some form of 
"contingency management or token economy was not used more often as a 
motivator for improving the reading performance of these students. 
Usefulness of Procedures 

At .the end $f the year, teachers who participated in the study 
completed questionnaires^ regarding their reactions to the data based 

r 

program modification procedures.. These data currently \are being 
analyzed as part of a larger study. However, preliminary findings .are 
quite favorable. Moreover, data gathered informal ly . during a 
presentation of findings at the end of the" year suggest that both 
trainers and teachers, for the most part, believed that the system 
•provides an indication of reading progress and growth. 

Although the present study did not support the contention, that 
teachers can use the evaluation system effectively to increase reading 
achievement, the results do demonstrate the feasibility of using such 
a system to monitor progress on lEP goals routinely—a necessary 
component of special education programs (PL 94-142). While teachers 
in the present sample were not successful in using the evaluation 



components of the system, preliminary findings' from a similar, though 
larger scale,- experiment in the i.'aw York City /public schools support 
the'eff icacy*of using such an approach to monitor and* evaluate reading 
progress (Fuchs, Deno,- & Mirkin, 1982). 
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Table 1. 

Grade, Sex, and Age of Students 



■ 




MumDer ot 
. Students 


Percentage 


f 


0 


Grade 




■ 










1 




1 


2. 


6% 






? 




2 


• 5. 


J /o 






3 




6 


15. 


8% 






4 




11 


cO . 


no/ 


• 




f 

5 




1 o 


26. 


3% 






0 


■ 




1 3. 


2% 






'Unknown 






1 . 


97o 








I ota i 


JO 


1 00 . 


0/O 






Sex 


• 












Male 




32 


84. 


2% . 






Female 




_6 


15. 


,8% . 


• 






Tot^l 


.38 


100. 


,0% 






Age (yrs) 














7 . ■ 




2 


5, 


.3% 






8-' 




3 


7, 


.9% 






.9 




4" 


10, 


.5% 






10 




11 


28, 


.9% 






11 




9 


23, 


. 7* 






12 . : 




5 


13, 


.2% 






13 




1 


2 


.6% 






Unknown 




>! _3 * 


7 


.9% 








Total 


38 


100 


.0% 
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Table 2 

Mean Scores on the Accuracy of Implementation 
Rating Scale (AIRS) a 





Time 1 


Time 2 


Time 3 



1 . 


Administering the Measurement 'Task 


4.00 


4. 


76 


4 


.45 


2. 


Selecting the Stimulus Material 


3.86 


4. 


76 


4 


.45 


3. 


Sampling for . Instructional Level 


4.00 


3. 


35 


" 3 


.33 


4. 


Baseline • 1 


4.29 


3. 


68 ' 


. 3 


.86 


5. 


Graph Set-up 


.4.00 


3. 


82 


3 


.93 


6. 


Afml i ne \ 


4.47 


4. 


94 « 


4 


.60 


7. 


Timing of Instructional Changes 


0.00 


3. 


23 


3 


.00 


8. 


Long-Range Goal 


4.70 . 


4. 


82 


' 4 


.73 


9. 


Short-Term Objective 


.3.70 


3. 


94 


3 


.33 


10. 


Instructional Plan 


3.80 


3. 


11 


3 


.53 


11 . 


Substantial Changes 


0.00 


2. 


25, 


Z 


.00 


12. 


Clear Change 


0.00 


3. 


50 


1 


.25 



— 1 t ' : : 

a Data are for experimental subjects only (N=19) . Rating scale: 
l=low, 5=high. 



Table 3 

Mean Scores on the Structure of Instruction Rating Scale 
(SIRS) and t Test Results for Time 1 

V 





- 


Mean 




Separate Error 
t . df 2 


Variance 
-tail Prob 


T n«; trurt i nna 1 HrouDino 


E ' 
C 


4.38 
4.06 


1 , 


.15 


26.45 


.26 


Teac her-di rected Learning 


E 
C 


4.11 
4.13 




.07 


30.42 


'.95 


Art i vp Acadpmic Re^nandina 


E 
C 


4.27 
4.20 




.28 


30.53 


.78 


'/'■ 

Dpmnn <;tra XA on / PromDt 1 n a 


E 
C 


3.83 
. 4.00 


- 


.51 


27.81 


.62 • 


u ui 1 l r u i i cu r i at l i l. c 


E 
C 


3.44 
4.21 


-2 


.80 


29.90 


.01* 


rrcuuciiL.jr u i wui i c^i 

Answers 


■ E 
C 


4.16 
4.33 




.77 


29.96 


.45 


Independent Practice 


E 
C 


2.33 
3.28 


-1 


.28 


9.11 


.23 


Corrections 


"E 
C 


4.11 
4.13 




.07 


27.25 


.94 


Positive Corisequences 


E 
C 


2.ff6 
2-/33 




.73 


30.23 


.47 


Pacing 

: — 


E 
C 


4.27 
4.13 




.43 


30.79 


.67 


v. 















23 
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Table. 4 

Mean Scores on the Structure of .Instruction- Rating Scale 
(SIRS) and t Test Results for Time 2 



Separate Error Variance 
Mean .t df 2-tail Prob 



1 

Instructional Grouping 


E 

C 


3.94 
3.55. 


v 95 


34. 


60 


.35 


Teacher-directed Learning 


E 

C - 


4 . 00 
3.77 


.80 


34. 


43 


.43 


Active Academic Responding 


E 
C 


4.31 
4.00 


1.13 


' 34. 


00 


ff 

.27 


Demonstration^ Prompting 


E 
C 


3.94 
3.76 


.62 


32. 


00 


,54 


Controlled Practice 


E 
C 


3.94 
3.25 


2.28 


29. 


16 


.03* 


Frequency of 'Correct 
Answers 


E 
C 


4.15 
4.27 


-.51 


34. 


29 


-.62 


Independent Practice 


E 
C 


1 .87 
2.50 


-.76 




62 


.47 ' 


Corrections 


E 
C 


4.21 
3.66 


1 .80 


31. 


84 


.08 


Positive Consequences 


E ■ 
C 


2.78 
2.22 


1 .23 


34. 


56 


.23 


Pacing 


E 
C 


3.94 
3.66 


.76 


34. 


53 


•45 


Oral Practice on Outcome 
Behavior 


E 
C 


3.68 
3.38 


'.70 


34. 


75 


.49 


Silent Practice on 
Outcome Behavior 


E 
C 


2.36 
2.77 


-.87 


35. 


00 


.39 



r 



24 
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Table 5 ' 

■'' \ 
Mean Scores on the Structure of Instruction* Rating Scale \ 

i j 



(SIRS), and t Test Results for Time 3 



Separate Error -Variance \ 
Mean t ' df 2-tail Prob\ 



Instructional Grouping - 


E 
C 


4.31 
3.76 


1 


.65 


30. 


87 


.11 


Teacher-directed Leahrning 


E 

C 


4.05 
3.88 ' 




.55 


33. 


.42 


. 58 ' v 


Active Academic Responding 


E 
C 


4.31 
4.11 


• 


.69 


" 33. 


.80 


.50 


Demo nst rat ion/ Prompting 


E 
C 


4.26 ' 
4.05 




.75 


33. 


.34 


.46 


Controlled Practice 


E 

C 


3.89 
3.11 


2 


.17 


28, 


.74 


.04* 


Frequency of Correct 
Answers 


*E 
C 


4.00 
4.05 


- 


.21 


33 


.91 . 


.84 . 


Independent Practice 

r ■ 

Corrections ^ 


E 
C 


2.22 , 
2.66 


-1 


.11 


14 


.62 


.28 


' E • 

•c 


4.10 
3.88 




.65 


31 


.49 


.52 


Positive Consequences 


E 

C 


2.42 
1 .94 


1 


.11 


33 


.96 


.27 


Pacing 


E 

C ' 


4.00 
3.88 




.29 


,33 


.92 


.77 


Oral Practice on Outcome 
Behavior 


E 
C 


3.68 
3.58 




• 

.22 


33 


.95 


.83 


Silent Practice on 
Outcome Behavior 


E 
C 


2.31 
2.47 




.33 


33 


.30 


.74 
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Table 6 , 

.. * ,../* f 

\ m 

") Reading Passage Data: T Score Transformation and Analysis of Variance 

. .„ ■ ■ ■ i 



Reading 
Passage • 


Group 


Time 1 


T Scores 


Time 2 


T Scores 


> 

/ 

Tirge 3 


7 ■ ' 
T Scores 


3 


E 

C 


-.6969 
-.1158 


93.1 
98.9 


I 

-.3398 • 
-.0555 


96.6 
99.9\ 


-.4585 
..-.0561 


95.4 
99/4 


4 


E 

C 


-.7163 
-.3076 


\?2.9 
96.9 


-.2261 
-.0645 


97.7 
99.4 . 


-.4832 
-.1324 


95.2 
98.7 . 


* 

5 .. 


E 
C 


-.6704 
-.3054 


' 93.3 
. 97.0 


-.3074- 
. -.2576 


. 96.9 ' 
97.4 


-.1547 
0841 


98.5 

99.2" • 
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Table 7 

Analysis of Variance Results for 
Reading Passage Data 



Passage 




df 


F 


prob 


3* 




2,21 






Time 






5.16 


.02* 


Tim£ 

4 


X Cond • 

' X 


2,19 


1 .21 


.32 


Time 






7.66 


.00* 


Time 


X Cond 




0.50 


.61 


5 • 




2,19 






Time 






1 . 65 . 


.22 


- Time 


X Cond 




0.21 


.81 
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Table 8 ' 
Mean Raw Scores on Subtests of the Stanford Diagnostic Reading 
(SDRT) and t Tes't Results 





jUf\ 1 jUULci L 




Mean 


Separate J »<Er?TTr-'Variance . 
t df 2-tail Prob 


Word Division 


E 


21. 


,40 












C 


23. 


,30 


-1 .06 


33. 


.14 


.30 ' 


Word Blending 


E 


19, 


.42' 










C 


21 . 


.33 


-.87 


33. 


.69 - 


.39 


Structural Analysis 


E 


40. 


.84 












C 


44. 


.66 


-1.03 


33, 


.72 


.31 


Literal Comprehension 


E 


'21. 


.55 












C 


23. 


.18 


-.59 ' 


26, 


.73 . 


.56 


Inferential Comprehension 


E 


18. 


■P 










C 


20. 


.162 


-.85 


26, 


.74 


.40 


Comprehension Total 


E 


39, 


.83 










C 


43, 


.81 


-.7,4 


26, 


.67 


.47 




\ 
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